RAP

Reproducible Analytcal Pipelines

Jake Tufts

What is “RAP”?

🎤

Why a pipeline?

Clean Data Pipeline

Realistic Data Pipeline

So… How do we deal with this?

Become a…

RAP God

You’re likely already implementing RAP

(or a variation of it)

The Reproducibilty Iceberg

An expanding brain meme styled iceberg showing the development of RAP.

Why should you RAP?



What happens if you don’t?

  1. Messy files and folders
  1. Confusing names
  1. Lost knowledge
  1. No papertrail, rollback or accountability
  1. Bugs and errors
  1. Duplicated work and low efficiency

Why should you RAP?

An expanding brain meme styled iceberg showing the development of RAP.

When they find out you RAP


It will make your life
easier

Your work will be of
higher quality

Work should be more
enjoyable
and even
fun!

Simple steps to achieve Reproducibility



  1. Tidy files and folders 🧹
  1. Effective naming 👨‍👩‍👦‍👦
  1. Document like you ctrl S ✍️
  1. Version control 🛠️
  1. Software stability 🤸
  1. Share, publish/open code, results… 🌎

Where to start?


Implement the 6 steps 👟

Checkout the DfE’s RAP strategy 🎲

Ask me for help! 🙋

References


Seibold. H., (10th Oct 2023). RAP webinar October 2023. Health Data Research UK. Available YouTubeat: https://www.youtube.com/watch?v=C27ubS0AioA. (Accessed: 20 December 2023)

Rodrigues. B., (October 3, 2023). Building reproducible analytical pipelines with R. Available at: https://raps-with-r.dev/ (Accessed: 20 December 2023)